Computational Grammar Induction for Linguists

نویسندگان

  • Pieter W. Adriaans
  • Menno van Zaanen
چکیده

In general a grammar describes a (possibly infinite) set of sentences with a finite structural description. Computational Grammar Induction (CGI) deals with the creation of computational models for identification of these infinite sets on the basis of a finite set of examples. CGI is a field in its own right, with its own internal research questions, many of which have no direct impact on the study of human language. Yet it is clear that computational models created by the CGI community might be of interest to the linguistic community because human language after all appears to be an infinite set, the description of which is learned efficiently in a relative short time. There are various domains in which learnability of a language might be interesting for linguists: e.g., first language acquisition, second language acquisition, or the automatic extraction of grammars from corpora. In this article we will focus on first language acquisition with some suggestions for extensions to other areas. with some suggestions for extensions to other areas. Up till now there has been surprisingly little cross-fertilization between the field of linguistics and that of Computational Grammar Induction. There are various reasons for this situation, for example, there is a strong interest in the descriptive study of language in linguistics on the one hand and a strong focus on abstract language models in the CGI community on the other hand. A factor that has certainly contributed this lack of cross-fertilization is the famous conjecture of Chomsky that the efficiency of human language acquisition can only be explained on the basis of an innate Universal Grammar (UG). The UG discussion, which is in fact a revival of the ancient philosophical debate of rationalism (Descartes)

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The use of corpora for automatic evaluation of grammar inference systems School of Computing University of Leeds Leeds LS 2 9 JT United Kingdom

The evaluation of grammar inference systems is clearly a non-trivial task, as it is possible to have more than one correct grammar for a given language. The 'looks good to me' approach, carried out by computational linguists analysing their own grammar inference system results, has prevailed for many years. This paper explores why this method has been so popular, in terms of its strengths, and ...

متن کامل

The use of corpora for automatic evaluation of grammar inference systems

The evaluation of grammar inference systems is clearly a non-trivial task, as it is possible to have more than one correct grammar for a given language. The ‘looks good to me’ approach, carried out by computational linguists analysing their own grammar inference system results, has prevailed for many years. This paper explores why this method has been so popular, in terms of its strengths, and ...

متن کامل

Syntactic islands and learning biases: Combining experimental syntax and computational modeling to investigate the language acquisition problem

2 Abstract The induction problems facing language learners have played a central role in debates about the types of learning biases that exist in the human brain. Many linguists have argued that some of the learning biases necessary to solve these language induction problems must be both innate and language-specific (i.e., the Universal Grammar (UG) hypothesis). Though there have been several r...

متن کامل

Platform for Full-Syntax Grammar Development Using Meta-grammar Constructs

This paper describes a combination of tools necessary for full or deep syntactic parsing of natural language – the syntactic parser synt, the graphical Grammar Development Workbench, GDW and the VerbaLex verb valency lexicon tools. We describe the development of the mentioned tools and how they integrate into one system that allows a team of experts (computational linguists as well as programme...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Grammars

دوره 7  شماره 

صفحات  -

تاریخ انتشار 2004